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1.  Introduction. 

Let  G be  a graph  on  the  set  of  vertices  V , where  ||V(|  = n . 

We -shall  write  u — v if  vertex  u is  adjacent  to  vertex  v in  G , 

and  u — H v if  they  are  not  adjacent.  A layout  of  G is  a one-to-one 
mapping  f that  takes  V into  the  positive  integers;  equivalently,  a 
layout  can  be  regarded  as  a string  of  vertices  and  "blanks",  with  each 
vertex  of  V appearing  exactly  once,  for  instance  b_c__da  . The 
correspondence  between  these  two  definitions  is  simply  that  f(v)  = k 
if  and  only  if  v is  the  k-th  element  of  the  string;  thus 
b_c__da  corresponds  to  f(a)  = 7 > f(b)  = 1 , f(c)  = 3 > f(d)  = 6 , 
where  V = {a,b,  c,  d}. 

The  bandwidth  of  a layout  f is  defined  to  be 

bandwidth(f)  = max{ | f (u)-f (v) [ : u — v}  , 

the  greatest  distance  between  G-adjacent  vertices  in  the  string  corresponding 
to  f . The  bandwidth  of  graph  G is  then 

Bandwidth(G)  = min {bandwidth (f ) : f is  a layout  of  G}  . 

It  is  clear  that 

Bandwidth(G)  = max{Bandwidth(G' ) : G'  is  a connected  component  of  G]  , 

for  if  f is  any  layout  there  is  another  layout  f'  , having  the  same 
bandwidth,  in  which  the  connected  components  of  G appear  "unmixed"  as 
substrings.  (We  can  let  f ' (v)  = f(v)+Nc(v)  , for  example,  where  c(v) 
is  the  number  of  the  component  containing  v , and  where  N is  sufficiently 
large. ) 

Perhaps  the  most  important  application  of  the  bandwidth  notion  arises 
in  connection  with  sparse  matrices.  Given  a sparse  nxn  matrix 
A = (a. .)  , let  G be  the  graph  on  vertices  [v, ,...,v  } where  v.  — v. 
for  i ^ j if  and  only  if  a. . ^ 0 or  a..  ^ 0 . Then  Bandwidth (G)  < k 

1 J J -A- 

if  and  only  if  there  is  a permutation  matrix  P such  that  all  elements 
T 

of  P AP  lie  on  the  diagonal  or  on  one  of  the  first  k superdiagonals 
or  the  first  k subdiagonals.  This  is  easily  proved  by  observing  that 
blanks  may  be  removed  from  a layout  without  increasing  the  bandwidth. 


When  G has  no  edges,  its  bandwidth  is  trivially  -o=  . 

Otherwise  the  bandwidth  will  be  as  low  as  1 if  and  only  if  each  component 
of  G is  an  isolated  point  or  a path,  namely  a subgraph  of  the  form 
v1  — — ...  — vn  , where  v — v.  iff  |i-j|  = 1 . It  is  easy  to 

determine  whether  or  not  Bandwidth(G)  = 1 , even  when  G is  not  known 
to  be  connected,  in  linear  time;  in  other  words,  there  is  an  algorithm 
which  decides  in  0(n)  steps  whether  or  not  a sparse  matrix  can  be 
converted  into  tridiagonal  form  by  simultaneous  row  and  column  permutations. 
(See  [13].)  The  simplicity  of  this  algorithm  suggests  naturally 
that  the  next  harder  case  might  not  be  too  difficult,  and  indeed  we  shall 
see  below  that  the  condition  Bandwidth (G)  = 2 can  be  tested  in  linear 
time.  However,  the  algorithm  which  achieves  this  is  quite  intricate, 
and  there  appears  to  be  no  elegant  way  to  characterize  graphs  of 
bandwidth  2 . 

The  authors  have  been  unable  to  construct  a polynomial-time  algorithm 
that  decides  whether  or  not  Bandwidth (G)  = 3 • The  bandwidth  2 case 
indicates  some  of  the  difficulties  which  must  be  surmounted.  Section  8 
below  shows  that  the  general  problem  of  deciding  whether  or  not 
Bandwidth(G)  < k , given  k , is  NP-complete,  even  if  G is  a free  tree 
with  all  vertices  of  degree  < 3 . This  restriction  to  trees  is  of  special 
interest  because  the  analogous  problem  of  minimizing  £ jf(u)  - f(v)|  instead 
of  max|f(u)  - f (v) J over  all  layouts  can  be  done  in  polynomial  time  when  the 
graph  is  a free  tree  [31],  yet  it  is  NP-complete  for  general  graphs  [17]. 

Section  9 considers  the  analogous  problems  which  arise  when  acyclic 
directed  graphs  replace  undirected  graphs.  Several  open  problems  conclude 
the  paper. 


3 


2.  Preliminaries  for  the  Algorithm. 

In  this  section  we  shall  begin  to  develop  an  algorithm  that  tests 
whether  or  not  Bandwidth(G)  = 2 . We  shall  assume  that  G is  connected 
and  that  it  has  at  least  one  vertex  of  degree  > 3 . (if  all  vertices 
are  of  degree  < 2 , it  is  easy  to  see  that  Bandwidth(G)  < 2 , since  such 
a graph  is  a collection  of  isolated  points,  paths,  and  cycles.)  The 
connectedness  assumption  implies  that  G has  at  least  n-1  edges,  and 
on  the  other  hand  we  may  assume  that  G has  at  most  2n-3  edges  since 
a graph  of  bandwidth  k cannot  have  more  than  (n-1)  + (n-2)  + ...  + (n-k) 
pairs  of  adjacent  vertices.  Therefore  our  algorithm  will  take  0(n) 
steps  if  its  running  time  is  bounded  by  a constant  times  the  number  of 
edges  in  G . 

In  order  to  get  into  the  right  frame  of  mind  for  this  problem,  the 
reader  is  urged  to  try  his  or  her  hand  at  finding  a bandwidth-2  layout 
for  the  graph  in  Figure  1.  Like  all  graphs  of  bandwidth  2 , this  one  is 
rather  "skinny";  a breadth-first  search  will  not  involve  many  unexplored 
nodes  at  any  time.  The  puzzle  which  the  reader  is  now  asked  to  try  is 
simply  this:  Arrange  the  27  vertices  of  Figure  1 into  a straight  line 

so  that  all  pairs  of  vertices  which  are  directly  linked  in  that  graph  are 
separated  by  at  most  one  other  vertex  in  the  line.  (This  puzzle  is  not 
quite  so  easy  as  it  looks.  The  algorithm  we  shall  develop  is  supposed 
to  work  in  linear  time,  essentially  without  backing  up,  but  no  such 
restriction  is  being  imposed  on  the  reader.) 


A B G H N Q T U V W 
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Figure  1.  Example  of  a graph  which  the  reader  is  urged  to  arrange 
into  a bandwidth-2  layout  before  proceeding  further. 
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Perhaps  the  most  important  notion  which  arises  in  connection  with 
graphs  of  bandwidth  2 is  the  concept  of  chains  within  G . We  say 
that  v begins  a chain  of  length  k if  there  are  vertices  v = v^,  ...,vk 
such  that 

V1  ~ V2  ~ * * ' ~ Vk 

in  G , and  each  of  v^,  ...,vk  p has  degree  2 ; furthermore  v^  must 
be  of  degree  1 , an  endpoint. 

Let  us  define  f(v)  =1  if  deg(v)  - 1 , and  f(v)  = k+1  if 
deg(v)  = 2 and  v — w where  l( w)  = k ; otherwise  l(v)  = co  . This 
function  is  well-defined  since  Bandwi.dth(G)  > 1 ; and  it  is  clearly 
possible  to  compute  f (v ) , for  all  v > in  0(n)  steps.  Therefore 
our  algorithm  will  assume  that  this  precomputation  has  been  carried  out. 

The  values  of  l for  the  example  graph  in  Figure  1 are  shown  in 

Figure  2.  Note  that  vertex  v is  part  of  a chain  if  and  only  if  f(v)  < «>  . 

~i  i I m\-1 

CO  2 2 00  <0  co 

\l  III  l\l 

00  ■■■—  00  — - 00  CO  — CO  1 CO 

Figure  2.  The  i function  for  the  example  graph  in  Figure  1; 
there  are  three  chains  of  length  2 . 


We  shall  say  that  a layout  f is  chain- stretched  if  | f (v^ )-f (v^+^) | = 2 
whenever  v^  and  v^+-^  are  consecutive  vertices  of  a chain.  This 
terminology  is  justified  because  of  the  following  observation. 


Lemma.  Every  graph  of  bandwidth  < 2 has  a chain-stretched  layout  of 
bandwidth  < 2 . 

Proof.  Let  f be  a layout  for  the  graph  G , where  Bandwidth (g)  < 2 ; 
we  may  assume  that  G is  connected.  Furthermore  we  shall  choose  f to 
have  the  maximum  "range  span"  over  all  bandwidth-2  layouts  for  G ; i.e., 


max  „ f(v)  - min  „ f(v)  is  to  be  maximum  over  all  f with 
veV  veV  v 

bandwidth(f)  < 2 . (The  maximum  range  span  is  finite,  at  most  2n-2  , 

since  G is  connected.)  We  shall  prove  that  f is  chain- stretched. 

If  not,  the  string  cp  corresponding  to  f contains  the  substring 

uv  , where  u and  v are  consecutive  vertices  of  a chain.  By  definition 

deg(u)  and  deg(v)  are  at  most  2 , and  u — v , hence  u and  v are 

each  adjacent  to  at  most  one  other  vertex.  By  maximality  of  f ’ s range 

span,  the  strings  obtained  from  cp  by  replacing  uv  by  u_v  and  v_u 

are  not  layouts  of  bandwidth  < 2 . It  follows  that  cp  contains  the 

substring  uvab  or  abuv  , where  a — u — v — b ; by  left-right 

symmetry  we  may  assume  that  9 contains  abuv  . Then  v must  be  the 

rightmost  nonblank  element  of  cp  . If  l(u.)  > £(v)  = k , graph  G 

contains  the  chain  v,  — v,  , — ...  — vn  where  v = v and  b = v.  , 

k k-1  1 k k-1 

but  then  cp  must  end  with  v^u^  • • • uk-lvk-lUkVk  ^ can  be 

lengthened  by  replacing  this  substring  by  u^  . . . u^  ^.-^k  vk_vk  1 *"_vi 

On  the  other  hand  if  f(v)  > f(u)  = k , a similar  argument  shows  that 
cp  ends  with  u^Ug  . . . u^  jvk_puil:v^  where  u = u^  and  a = u^^  , and 

this  substring  can  be  replaced  by  _vp_- ♦ ♦_vjc_ujc_ujc_p. • • *_ui  • toth 

cases  the  maximality  of  range  span  has  been  contradicted.  □ 

The  algorithm  we  shall  develop  below  is  based  on  a subalgorithm 
which  solves  the  following  problem:  "Given  a connected  graph  G and  two 
vertices  a and  b , decide  whether  or  not  there  exists  a layout  f of 
bandwidth  < 2 such  that  f(a)  = 1 and  f(b)  = 2 ."  If  such  a layout 
beginning  with  ab  exists,  the  algorithm  will  construct  one;  and  in  all 
cases  the  algorithm  will  terminate  after  0(n)  steps.  The  idea  is  to 
build  the  layout  step  by  step,  working  with  partial  layouts,  namely  with 
one-to-one  functions  f that  are  defined  only  on  a subset  of  the  vertices 
All  partial  layouts  we  shall  deal  with  will  satisfy  the  bandwidth  2 
condition,  in  the  sense  that  |f(u)  -f(v)|  < 2 whenever  f(u)  , f(v)  are 
both  defined  and  u — v . Furthermore  we  know  by  the  lemma  that  it 
suffices  to  restrict  attention  to  chain- stretched  partial  layouts. 
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If  f is  a partial  layout  defined  on  the  set  of  vertices  U , 
the  active  vertices  of  f are  those  elements  ueU  such  that  u — v 
for  some  v^U  . If  f^  is  a partial  layout  defined  on  and  fg 

is  a partial  layout  defined  on  V2  j Vx  , we  say  that  f2  is  an  extension 
of  f^  if  fg(v)  = f-^(v)  for  all  veV^  . We  also  say  that  f is  a 
complete  layout  if  = V and  bandwidth (f0)  < 2 . Thus  the  task  of  our 
sub algorithm  will  be  to  decide  whether  or  not  the  partial  layout  f 
defined  by  the  string  ab  (i.e.,  f(a)  = 1 and  f(b)  = 2 ) can  be 
extended  to  a complete  layout. 

The  subalgorithm  actually  does  more,  since  its  initial  task  leads 
to  a family  of  similar  subtasks  of  three  types: 

Type  A.  Given  a partial  layout  defined  by  the  string  a ab  , where  at 
most  a and  b are  active,  can  it  be  extended  to  a 
complete  layout? 

Type  B.  Given  two  partial  layouts  defined  by  the  strings 

aa.  b ...a,b,  and  ab  a . ..b, a,  , for  some  m > 1 , 
mm  11  mm  lx  ’ — 

where  at  most  a^  and  b^  are  active,  can  at  least  one 

of  these  be  extended  to  a complete  layout? 

Type  C.  Given  a partial  layout  defined  by  the  string  qp  = a_affl 
for  some  m > 1 , where  at  most  a.  is  active,  can  it 
be  extended  to  a complete  layout? 

In  each  case  a is  a (possibly  empty)  initial  string  which  has  no  important 
influence  on  the  algorithm,  since  it  represents  inactive  vertices  and  blanks 
that  have  already  been  permanently  placed.  The  string  a in  tasks  of  Type  C 
will  have  length  > 2 , and  its  final  two  elements  will  be  nonblank.  The  two 
strings  in  tasks  of  Type  B will  be  denoted  by  cp  = a(ambm>  ...  (a^b^)  . 

The  idea  of  the  sub algorithm  is  quite  simple,  namely  to  "keep  doing 
something  useful."  Let  f be  a partial  layout  of  one  of  the  three  types, 
defined  on  the  vertices  U . (Actually  f represents  two  partial 
layouts  if  it  is  of  Type  B,  but  it  will  be  convenient  to  ignore  this 
fine  distinction  in  our  informal  discussion. ) By  looking  at  how  the 
active  vertices  of  f interact  with  vertices  / U,  it  may  be  obvious 
that  f cannot  be  completed.  Otherwise  the  subalgorithm  will  find  a 
sufficiently  general  extension  of  f , namely  an  extension  layout  f 
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which  can  be  completed  whenever  f can  be;  and  f'  will  have  one  of  the 
three  basic  types.  If  any  suitable  extension  is  found,  the  string  cp 
corresponding  to  f will  be  replaced  by  the  string  cp'  corresponding 
to  f ' , and  the  process  will  continue  until  either  reaching  an  impasse 
or  a complete  layout.  The  running  time  for  each  extension  step  will  be 
bounded,  except  in  one  case  where  the  running  time  can  be  "charged"  to 
subsequent  extension  steps;  hence  the  total  time  will  be  0(n)  . 

In  Section  7 we  shall  show  how  the  subalgorithm  can  be  used  to 
construct  an  algorithm  that  solves  the  general  bandwidth  2 problem 
(without  any  given  partial  layout),  in  linear  time. 
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The  Subalgorithm  for  Types  A and  B. 


We  shall  present  the  subalgorithm  informally,  with  proofs  of  the 
validity  of  each  extension  intermixed  with  specifications  of  the  actual 
operations  to  be  carried  out.  The  actions  will  be  of  three  kinds: 

(a)  Terminate  successfully  because  cp  is  complete;  (b)  terminate 
unsuccessfully  because  cp  cannot  be  completed;  (c)  set  cp'  to  a 
sufficiently  general  extension  of  cp  . It  is  hoped  that  this  manner  of 
presenting  the  procedure  will  make  it  easy  to  understand  and  reasonably 
enjoyable  to  read.  Examples  of  the  sub  algorithm  in  operation  appear  in 
Section  6 below. 

The  following  notation  will  be  used  for  convenience: 

U = set  of  vertices  appearing  in  cp  = domain  of  current 
partial  layout  f ; 

S(u)  = [v  | u — v and  v/u}  ~ "successors"  of  vertex  u ; 
n(u)  = j[S(u)jj  = number  of  "successors"  of  u ; 

.?(u)  = chain  level  of  u (defined  earlier). 

It  is  clearly  possible  to  build  and  maintain  data  structures  so 
that  references  to  S(u)  , n(u)  , £(u)  take  a bounded  amount  of  time. 

The  subalgorithm  consists  of  a long  but  exhaustive  list  of  cases  covering 
which  actions  are  appropriate  under  various  circumstances  that  can  arise. 

First  let  us  consider  Type  A,  recalling  that  tasks  of  this  type  are 
specified  by  the  string  cp  = crab  , where  at  most  a and  b are  active. 


Case  Al,  n(a)  >1  or  n(b)  > 2 . Failure. 

Case  A2,  n(a)  = 1 . Set  cp'  = aabc  where  S(a)  = [c } . 

Case  A3,  n(a)  = 0 , n(b)  = 2 . Set  cp'  = aab(cd)  where  s(b)  = {c,  d} 

Case  a4,  n(a)  = 0 , n(b)  = 1 . Set  <p'  = aab_c  where  S(b)  = {c]  . 

Case  A5,  n(a)  = 0 , n(b)  = 0 . Success. 

Note  that  Cases  A2,  A3,  A4  lead  to  new  problems  of  Type  A,  B,  C respectively; 

the  proofs  of  validity  in  each  case  are  trivial. 
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Recall  that  tasks  of  Type  B are  specified  by  the  string 

<p  = a (a.  b ) ...  (a,bn)  , for  some  m > 1 , where  at  most  a,  and  b, 
v N m m'  ' 1 1'  — 1 1 

axe  active.  Actually  cp  represents  a potential  Choice  between  two 

partial  layouts,  aa  b . ,.a,b..  and  ab  & . ..b.,a,  . For  convenience 
x mm  11  mm  Xl 

we  shall  write  a = , b = b^  ; we  may  assume  by  symmetry  that 

n(a)  < n(b)  . 

Case  Bl,  ,jS(a)  IJ  S(b)jj  >2  or  n( a)  = n(b)  = 2 . Failure. 

Case  B2.  n(a)  = 1 , n(b)  = 2 . Set  cp’  = Q^b^  ...  a^b^cd 

where  S(a)  = {c]  and  S(b)  = fc, d]  . 

Case  B3.  n(a)  = 0 , n(b)  = 2 . Set  cp’  = aa^b^  ...  a^b^cd) 
where  S(b)  = {c,d} 

Case  Bl.  n(a)  = 1 , n(b)  = 1 , S(a)  = S(b)  . Set  cp'  = aambm  . . . a-^c 

where  S(a)  = {c}  . 

Case  B5.  n(a)  = 1 , n(b)  = 1 , S(a)  ^ S(b)  . Set 

<P’  = a<ambm>  •••  where  S(a)  = {c}  , S(b)  = {d}  . 

Case  B 6.  n(a)  = 0 , n(b)  = 1 . Set  cp’  = aa^^  ...  a-jb-^c  where 
S(a)  = [c]  . 

Case  B7 . n(a)  = 0 , n(b)  = 0 . Success. 

Again  the  proofs  in  each  case  axe  trivial;  we  shall  discuss  only  case  B6 

here:  Any  completion  of  cp  must  be  of  the  forms  aambm  . . . a-^b^xcuo 

(where  x is  a vertex  or  a blank),  aa  b ...  a,b.,c<jo  , or 
' m m 1 1 

ab  a . ..b.,a,  cw  . The  first  of  these  is  an  extension  of  cp'  ; and  the 
m m 1 1 t j 

second  or  third  imply  that  aatabm  . . . a^b^_ccu  is  also  a complete 
extension. 


Recall  that  tasks  of  Type  C are  specified,  by  the  string  q>  = a a^. . . a^  , 
for  some  m > 1 , where  at  most  a-^  is  active  and  a contains  no  usable 
blanks.  This  type  of  partial  layout  allows  considerably  more  flexibility 
than  Types  A and  B do,  since  it  may  be  possible  to  make  good  use  of  the 
m blanks.  Let  us  write  a as  a shorthand  for  a^  . Furthermore  we 
shall  write  U'  = UUS(a)  , with  S' (u)  and  n' (u)  defined  correspondingly. 

Case  Cl,  n(a)  > 3 • Failure. 

Case  C2,  S(a)  = [b,  c,  d]  . 

In  this  case  the  final  neighborhood  of  a in  a complete  extension  must  be 
bacd  , bade  , cabd  , cadb  , dabc  , or  dacb  ; the  possibilities  can  be 
narrowed  down  by  considering  various  subcases.  Symmetry  between  b , c , d 
is  used  in  order  to  reduce  the  number  of  possibilities;  in  other  words,  there 
is  always  a way  to  rename  the  elements  of  3(a)  so  that  some  subcase 
applies.  We  shall  say  that  a vertex  u in  S(a)  is  feasible  if  it  can 
conceivably  fit  to  the  left  of  a^  ; thus  u is  feasible  if  S' (u)  = [v] 
where  £(v)  < m , or  if  n' (u)  = 0 . In  the  former  case  we  say  that  u 
is  f(v)  -feasible;  in  the  latter  case  we  say  that  u is  O-feasible. 


Case  C2.1,  b — c , b — d , c — d . Failure. 

Case  C2.2,  b-4-c,b  — d,c  — d. 

In  this  case  we  must  decide  between  bade  and  cadb  . 

Case  C2.2.1,  neither  b nor  c is  feasible.  Failure. 

Case  C2.2.2,  b is  feasible  but  not  c . Set  cp'  = o:[ba]dc  . 

Here  and  in  the  sequel  we  shall  use  the  following  notation: 

[ba]  = _ato. . •_ajcf2bkak+l*  • ,boal  = b0  is  k- feasible  and 

b1  — ...  — b^  is  the  corresponding  chain  of  length  k . In  other  words, 
[ba]  stands  for  the  string  am“*  aq  b and-  ibs  successors 

inserted  into  the  appropriate  blank  spaces. 

Case  C2.2.3>  b is  k-feasible  and  c is  f-feasible 
where  k > l . Set  cp’  = ctfbaldc  . 


To  justify  this  step,  we  shall  prove  that 
ct[ba]dc  > a[ca]db  , 

where  we  say  that  partial  layout  dominates  rp0  (written  cp^  > <P2  ) 

if  every  completion  of  cp^  implies  the  existence  of  a completion  of  cp^  . 
In  our  case  any  chain-stretched  completion  of  cp  which  is  not  an 
extension  of  cp'  must  be  an  extension  of  a[ca]db  , so  it  must  have 
the  form  cp"  = a[ca]d0b0d^b^  ...  d^b^ou  . Let  cQ  — c^  — ...  — c^ 

be  the  chain  adjacent  to  c = c^  and  let  Cj  he  blank  if  i < j < k . 
Then  we  may  interchange  cQ,  ...,ck  with  bQ,  ...,bk  in  cp"  , obtaining 
a valid  completion  of  cp  which  extends  cp  ’ . 

It  is  important  that  the  reader  understand  the  justification  of 
step  C2.2.3  at  this  point  before  proceeding  further.  Although  the 
argument  is  very  simple,  we  shall  be  using  it  repeatedly  in  the  sequel, 
with  various  refinements  and  extensions  as  the  cases  get  more  complex. 

Case  C2.3,  b — c , b -4-  d , c-)-d. 

In  this  case  we  must  decide  between  bacd  , cabd  , dabc  , and  dacb  . 

Case  C2.3.1,  neither  b nor  c is  feasible.  Failure,  unless 
d is  feasible.  In  the  latter  case,  set 
cp ' = a[da] (be)  . 

Case  C2.3.2,  b is  feasible  but  not  c ; say  b is  k- feasible. 

If  d is  ^-feasible  where  l > k , set  cp'  = ct[da]bc  , 
otherwise  set  cp'  = a[ba]cd  . 

To  justify  this  step,  note  that  a[ba]cd  is  forced  unless  d is  feasible. 
In  the  latter  case  a[da]cb  cannot  be  better  than  a[da]bc  , since 
b = bfl  must  be  followed  by  b^, . . . , bk  , with  following  two 

positions  after  b^  ; it  is  easy  to  see  that  any  completion  of  a[da]cb 
can  be  converted  into  one  which  extends  a[da]bc  . Thus  we  must  simply 
distinguish  between  bacd  and  dabc  , and  the  argument  is  similar  to 
Case  C2.2.3. 
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Case  C2.3.3,  b is  k-feasible  and.  c is  ^-feasible,  where 

k > t . Set  cp'  = a[ba]cd  . 

The  argument  is  like  Case  2.2.3  again;  if  d is  feasible  too,  we  will 
soon  be  successful,  regardless  of  which  alternative  is  chosen. 

Case  C2.4,  b -f  c , b -)-  4 , c d . 

All  six  possibilities  of  Case  C2  still  remain,  but  we  can  make  use  of 
the  symmetry. 

Case  C2.4.1,  none  of  b , c , d is  feasible.  Failure. 

Case  C2.4.2,  b is  feasible  but  c and  d are  not.  Set 

cp'  = a[ba]  (cd)  . 

Case  C2.1.3,  b is  k-feasible  and  c is  f -feasible,  where  i < k , 

but  d is  infeasible.  Set  cp'  = a[ba]cd  . 

In  this  case  a[ba]cd  > cn[ba]dc  and  a[ca]bd  > a[ca]db  as  in  Case  C2.3.2, 
while  a[ba]cd  > a[ca]bd  as  in  Case  2.2.3* 

Case  C2.11-.4,  all  of  b , c , d are  feasible.  Set  cp'  = a[ba]cd  . 
Success  is  imminent. 

Case  C3,  S(a)  = {b, c}  . See  Section  5. 

This  is  by  far  the  hardest  case  to  handle,  and  we  shall  postpone  it  for 
a moment  since  the  remaining  cases  are  very  simple. 

Case  C4,  S(a)  = {b}  . Set  cp'  = a_an< . . _a1_b  . 

This  clearly  dominates  a_an« . . and  a_an* . < _a-|_b  . 

Case  C5,  n(a)  = 0 . Success. 
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5 . The  Subalgorithm  for  Type  C,  Case  C3. 

Now  we  must  face  up  to  Case  C3;  as  above  we  have  cp  = a a^. . . 
and  a = a^  and  S(a)=(b,c}.  We  should  replace  the  substring  a at 
the  right  of  cp  by  either  abc  , acb  , bac  , ba  c , cab  , or  ca  b , 
where  the  dashes  may  or  may  not  get  filled  in  later.  Fortunately  we  can 
rule  out  two  of  these  possibilities  immediately,  since  bac  is  never  better 
than  abc  and  cab  is  (similarly)  never  better  than  acb  : The  complete 
layout  a[ba]cuo  which  extends  bac  can  always  be  converted  to  a complete 
layout  a[b^a]bciu  which  extends  abc  . 

Case  C3.1>  b — c . 

In  this  case  we  have  to  distinguish  between  abc  and  acb  . Let  us  say 
that  b is  k-lucky  if  S' (b)  contains  a vertex  b^  with  f(b^)  = k and 
k < m . (if  there  are  two  or  more  such  vertices  b^  , choose  one  with 
maximum  k . ) Similarly  c might  be  lucky;  we  can  use  the  blanks  left 
of  a for  one  of  the  successors  of  a lucky  vertex. 

Case  C3.1.1,  neither  b nor  c is  lucky.  Set  cp'  = a a . ..  a^(bc)  . 

Case  C3.1.2,  b is  k-lucky  and  c is  either  (i)  unlucky  or 

(ii)  f-lucky  where  t < k , or  (iii)  k-lucky  and 
n' (b)  < n' (c)  . Set  cp'  =a[b,a]bc  . 

To  justify  this  step,  we  first  argue  (as  in  Case  C2.2.3)  that  the  layout 
a_am. . . _a^bcb-|  has  no  advantage  over  cp'  . Therefore  the  only  competing 
possibility  is  a_am* ..  a^cb  . By  considering  the  two  ways  to  place  b^ 
in  the  latter  string,  we  have  two  possible  types  of  completion  to  consider, 
say  cp"  = a[c1a]cbx1b1. . .x^b^u)  and  cp"'  = a[c^a]cbb-Lx1. . .x^-jb^u)  , since 

b,  has  degree  < 2 and  is  part  of  a stretched  chain.  (Here  c,  is  blank 

x . 

if  c is  unlucky  or  if  we  do  not  choose  to  make  use  of  c 's  luckiness.) 

We  can  always  replace  cp"  by  a[b^a]bcx^c-^. . .x^c^uo  , an  extension  of  cp'  ; 
similarly,  <p"  ' can  always  be  replaced  by  afb^ajbcx^c^. . .x^^c^  ^uj 

unless  c is  k-lucky.  But  in  the  latter  case  we  have  n' (b)  < n'(c)  = 1 
by  hypothesis,  so  the  are  all  blank  and  ou  is  empty;  cp'”  can 

therefore  be  replaced  by  a[b1a]bc_c-L. . . _c^  . 
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Case  CJ.2,  b -J-  c and  n'(b)  > 3 . Failure. 

Case  C3.3,  b -I-  c and  n'(b)  = 3 • If  S'(b)nS'(c)  = {d}  and 

either  S'(c)  ^ {d}  or  S'  (c)  = {c^d}  where  f(c^)  < m , 

set  cp'  = a[ca]db  . If  S'(b)  OS'(c)  = 0 and  either 

S'(c)  =0  or  S'(c)  a {c^}  where  f(c1)  < m , set 
cp'  = a[ca]  b . Otherwise  failure. 

Case  C3.4,  b -+-  c and  max(n' (b),n' (c) ) = 2 . 

Case  C3.4.1,  S'(b)  = S'(c)  . Failure. 

Case  C3.4.2,  S'(b)nS'(c)  = {d}  . If  n'(b)  = 2 , let 

S'(b)  = (b^,d]  ; if  n'(c)  = 2 let  S' (c)  = [c^,dj  . 

In  this  case  we  say  that  b is  k-lucky  if  f(b^)  = k and  k < m ; 
b is  O-lucky  if  n'(b)  = 1 ; otherwise  b is  unlucky.  Similarly  c can 
be  lucky  or  unlucky.  There  are  four  viable  alternatives  to  decide  between, 
namely  a[ba]dc  , a[b^a]bcd  , a[ca]db  , and  a[c^a]cbd  . 

Case  C3.4.2.1,  neither  b nor  c is  lucky.  Failure. 

Case  C3.4.2.2,  b is  k-lucky  and  c is  unlucky.  If  k = m , set 
cp'  = a[b^a]bcd  . Otherwise  set 

<p'  = a-V*--V2<bkVi>'--<bai><dc)  • 

This  is  the  neatest  part  of  the  entire  algorithm,  since  the  two  viable 
alternatives  a[ba]dc  and  a^ajbcd  turn  out  to  be  essentially  a 
Type  B situation.  (On  the  other  hand  it  may  also  be  considered  the 
sloppiest  part  of  the  algorithm,  since  an  abuse  of  notation  is  involved 
here:  If  the  Type  B specification  is  ultimately  completed  to  a string 

of  the  form  a a . ..  ajt+2ak+ltV  • ‘a-^bcdoj  , a blank  should  actually  be 
inserted  just  before  • ) 

Case  C3. 4.2.3,  13  is  k-lucky  and  c is  f-lucky,  where  k > l . 

Get  cp'  = Otfb^ajbcd  . 

It  is  easy  to  check  that  cp'  dominates  the  other  three  alternatives, 
using  arguments  like  those  in  Section  4. 
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Case  C3.4.3. 


S'(b)nS' (c)  = 0 and  n'(b)  = n'(c)  = 2 . Let 
S'  (b)  = {b-^jbjJ  and  S' (c)  = cp  , where 

i(bx)  < i(b-j_)  and  i(cp  < f(cp  . 

The  only  possibilities  are  a[ba]bpc  c£  and  a[ca]cjbbp.{  , perhaps 
interchanging  b^  with  b|  and/or  with  . 

Case  C3. 4.3.1,  f(bp  > m . If  £(cp  > m , failure;  otherwise  if 
£(cp  > m , set  q> ' = a[ca]cp  ; otherwise  set 
cp ' = a[c'a]c1b  , where  " [c’a]  " means  that  the 
blanks  are  to  be  filled  by  c and  the  chain 
containing  p . 

These  actions  are  forced  unless  f(cp  = f(cp  = 1 , for  if  c^  and  cJ{ 
both  have  finite  level  we  must  have  i(c^)  = 1 or  failure  will  be  imminent. 

Case  C3.4.3.2,  f(b^)  < m < f(bp  , Hcj_)  < m < f(cp  , and 

n'  (b|)  < n' (cp  . If  S' (bp  / [cp  , failure; 
otherwise  set  cp'  = a[ba]bp  . 

In  this  case  it  is  impossible  to  complete  cp  with  a[ba]bpc^c|  , since 
f(bp  > 1 ; the  only  viable  alternatives  are  a[ba]bpcp^  and  ct[ca]cpb.'b^ 
and  we  must  have  b^  — c£  . Now  if  S'(cp  / {bp  , the  stated  value  of  cp’ 
is  forced,  otherwise  success  is  imminent. 

Case  C3.4.3.3,  £(bp  <m  < l(bp  and  i(cp  111  • Set  cp'  = a|c'a]c  t> 

This  is  essentially  forced,  since  ct[ba]bp (c^cp  implies  f(bp  = 1 
when  c^  and  c.{  have  finite  level. 

Case  C3.4.3.4,  f(bp  < m , f(cp  < m , and  f(b^)  _£  l(C]_)  • Set 
cp'  = a[b'  a]b^c  . 

As  in  Case  C3. 4.3.1  we  see  that  failure  will  occur  unless  f(bp  = 1 . 

Case  C3.4.4.  S'  (b)  fl  S'  (c)  = 0 , n'  (b)  = 2 , and  n'  (c)  < 1 . Let 
S'  (b)  = (b1,bp  , where  f(bp  < f(bp  j and  if 
n'(c)  = 1 , let  S'(c)  = {cp  , otherwise  let  c^ 
be  blank  and  l(c^)  = 0 . 
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There  are  many  possible  arrangements  to  choose  from,  and  the  subcases 
require  careful  analysis. 


Case  CJ.k.k.l,  {( b^)  > m . If  /(c^)  > m , set  <p’  = a am»«»  agcac^b  , 
If  f(c^)  = m , set  cp ’ = a[c^a]cb  . Otherwise  set 
cp ' = a[ca]  b . 

Case  C$.k.h.2,  l(b^)  <m  , £(c^)  < m . If  ^(c^)  = m or  f(b£)  < ® , 

set  cp'  = a[c^a]cb  . Otherwise  if  £(c^)  < f(b^)-2  , 
set  cp'  = afb-^ajbc  ; otherwise  set  cp'  = a[ea]b.jb  . 

If  £(b£)  < co  , success  is  imminent,  so  we  may  assume  that  i(b^)  = ® . 

Then  a[b,a]bcb'  > a[ba]b'c  ; and  a[ca]  b >a[c1a]cb  > a a ...  a cac^b  , 
unless  f(c1)  = m when  a[ca]_b  is  inapplicable.  If  £(0^)  = m , it  is 
clear  that  a[c^ajcb  > ttfb^albcb^  ; otherwise  we  need  to  compare 
a[b-^a]bcb^  with  a[ca]  b , and  the  best  place  for  b^  in  the  latter 
string  is  afcajb^b  b^  . The  stretched  chains  in  these  two  alternatives 
now  fill  respectively  l(c^)  and  f(b1)-2  positions  to  the  right  of  b^  , 
and  it  is  best  to  minimize  this  quantity. 


Case  CJ.k.h.J,  £(b^)  < m and  f ( c_^ ) > m . If  £(b^)  = m , set 

cp’  = a[b^a]bcb|c^  . Otherwise  if  £(b£)  = m , set 

cp'  = a[b-^albcb^c^  . Otherwise  if  f(b^)  < m , set 

cp'  = atb'ajb-^c  . Otherwise  let  k = £(b^)  ; set 

?'  = a_V,*-aTc+2^kak+l^,**(bla2^bal^biC^  * 

As  in  Case  C3.^.2.2,  this  is  a slight  abuse  of  notation. 


Case  C5-^.5,  S'(b)DS'(c)  = ft  , max(n' (b),n' (c))  = 1 . If 

n'(b)  = 1 , let  S'(b)  = {b^}  ; otherwise  let  b^ 
be  blank  and  set  ^(b^)  = 0 . Define  c^  similarly. 

Case  C3.^.5.1,  £(b^)  < m and  f(c^)  < m . Set  cp'  = qpbc  . 

Success  is  imminent. 


Case  C3.1+.5.2,  £(b^)  < m and  f(c^)  > m . If  £(b^)  = m , set 

qp ' = a[b^a]bc  , otherwise  set  cp'  = u[ba]  c . 
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Case  C5.U.5.3 > /(b^)  >m  311(1  *(cp)  > m • 

In  this  final  case  we  must  "look  ahead"  before  deciding  what  to  do. 

For  k > 1 if  b^  has  degree  2 , let  bk+1  be  the 
vertex  adjacent  to  bk  which  has  not  yet  been  given 
a name;  continue  until  having  found  the  sequence 
b — — ...  — bk  where  deg(bR)  ^ 2 . Similarly, 

find  the  sequence  c — c^  — ...  — where 
deg(cf)  / 2 . 

(This  process  must  terminate,  since  G is  not  a cycle. ) 

Case  C5.h.5.}.l,  b^  = = a or  deg(bk)  = deg(c^)  = 1 . Get 

p'  = pbc  . 

Success  is  imminent. 

Case  C3.k.5.3«2,  deg(b  ) = 1 and  deg(c  ) > 2 . Set 

K x 

p'  = a am*  • • agbab^c  . 

Case  C3.1+.5.3.3>  deg(bk)  > 2 , deg(c{)  > 2 , and  k < £ . 

In  this  case  we  must  decide  between  four  alternatives  abcb^c^. . .bk_^ck_jt>k  , 
acbc-^b^. . .ck  ^bk  -^ck  , bab^cb^c^. . ,bkck  ^ , and  cac^bc^b^. . .ckbk_^  ; 
by  acquiring  a little  more  information  about  bk  , c^  , k , and  £ it  will 
become  clear  which  of'  these  dominates: 

Case  C3.t.5.3.3.1>  bk  = c . If  k = £ , set  p'  = pbcb^  ; 

otherwise  set  p'  = a a ...  a^ca..e-1b  . 

Case  Q3. h.rj. 3. 3. 2,  bR  — c . If  k = £ , set  p'  = p<bc)(h1c1>  ; 

otherwise  set  p'  = a a . . . _a0(ac)(bc1)  . 

Case  03 . 4.5 .3 *3 *3>  bk  ^ C£  ’ bk  4-  cf  . Failure. 

Note  that  the  "lookahead  time"  required  to  find  k and  £ in  Case  C^.h. 3. 3 
is  0(k+£)  , not  0(1)  ; but  Case  03. ^.5*3  cannot  occur  again  until 
bf,  ...,bk  1 have  811  been  included  in  the  string  p . Thus 

the  lookahead  time  can  be  distributed  among  the  subsequent  steps,  and  the 
subalgorithm  runs  in  linear  time. 

We  have  now  exhausted  all  possible  cases,  and  the  subalgorithm  is  complete. 
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Examples. 

Here  is  how  the  subalgorithm  would  proceed  to  search  for  a layout 
for  the  graph  of  Figure  1,  beginning  with  DC  : 


Case 

A3 

B2 

A3 

Bl 
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DC 

DC(AE) 

DCAEBF 

DCAEBF(GJ) 

Failure. 


On  the  other  hand,  if  we  begin  with  DA  , the  algorithm  succeeds: 


A2 

A2 

A2 

A4 

C3.1+.4.l(i) 

A2 

A2 

Ah 

C3.4.4.l(ii) 

A3 

B5 

b6 

Cl 

C2.3.l(ii) 

B2 

A5 


DA 

DAC 

DACE 

DACBE 

DAC BE  F 

DACBEGFHJ 

DACBEGFHJ1 

DACBEGFHJIK 

DACBEGFHJ IK  L 

DACBEGFHJ1KNIM0 

DACBEGFHJ IKNLMO(PR) 

DACBEGFHJ  IKNLMO  ( PR  ) < QS> 

DACBEGFHJ IKNLMOFRQS  T 

DACBEGFHJ IKNLMOPRQS’T  U 

DACBEGFHJ  IKMMOPRQSWTVU(XY) 

DACBEGFHJIKNLMOPRQSWTVUYX&Z 

Success. 


Here  is  how  the  algorithm  would  construct  the  same  solution 
"backwards",  starting  with  Z&  : 


A2 
A2 
A 2 
A3 
B5 
B 6 
CU 

C3.4.4.2(iii) 

A2 

A2 

AU 

C3.4.4.l(i ) 

A 2 
A2 
A3 
B2 
A5 


2& 

Z&X 

Z&XY 

Z&XYU 

Z&XYU<TV) 

Z&XYU(TV)<SW) 

Z&XYUVTWS  R 

ZfeXYUVTWS"R  0 

Z&XYUVTWoQPPOML 

Z&XYUVTWSQRPOMLN 

Z&XYUVTWSQRPOMLNK 

^feXYUVTWSQRPOMLNK  J 

ZAXYUVTWSQPPOMLNKIJHF 

Z^YUVTWSQRPOMLNKIJHFG 

ZdrXYUVTWSQRPOMLNKIJHFGE 

ZAXiWTWSQRFOMLNKI  JHFGE(BC  ) 

Z&XYUVTWS  QRPOMLNKI JHFGEBC  AD 

Success. 


N I,»l-  - 


If  the  algorithm  had  chosen  the  somewhat  tempting  alternative 
Z&XYUVTWSNBMOLPK  at  step  C3 . 4 . 4 . 2 in  this  example,  failure  would  have 
followed  soon  after. 

Suppose  Figure  1 were  changed  so  that  F J became  F * 

Then  the  algorithm  would  invoke  further  cases: 

Z&XYUVTWSQRPOMLNK  J 
Z^UVTWSQRPOMLNKIJH* 

Z^YUVTWSQFPOMLNKIJH*G 
Z&XYUVTWS QPPOMLNKI  JH*GF 
Z&XYUVTWSQPPOMLNKIJH*GF  E 
Z&XYUVTW  S QRPOMLN  KI JH* GFDECBA 
Success. 


C3. 4.5.3. 

A2 

A2 

Ah 

C3.4.2.3 

A5 
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7 . Applications  of  the  Subalgorithm. 

The  sub  algorithm  determines  in  0(n)  steps  whether  or  not  G has 
a bandwidth-2  layout  beginning  with  ab  ; by  trying  all  possible  a and 
b we  have  an  0(n^ ) algorithm  for  deciding  whether  or  not  Bandwidth (G)  < 2 . 
This  can  be  improved  to  an  0(n  ) algorithm,  by  using  the  subalgorithm 
to  decide  whether  or  not  G has  a complete  layout  that  extends  xy_ a , for 
some  vertex  a and  some  (nonexistent)  dummy  vertices  x and  y . However,  we 
really  want  an  0(n)  algorithm,  so  it  is  necessary  to  be  a little  more  careful. 

We  observed  at  the  beginning  of  Section  2 that  G may  be  assumed 
to  contain  a vertex  v of  degree  > 3 j suppose  v — a , v — b , and 
v — c . Then  any  layout  for  G must  contain  one  of  the  six  substrings 

vab  , vba  , vac  , vca  , vbc  , vcb  , 

or  their  left-right  reflections,  since  two  of  [a,b, c]  must  appear  on  the 
same  side  of  v . To  test  Bandwidth(G)  <2  in  linear  time,  it  therefore 
suffices  to  have  a linear-time  algorithm  that  determines  whether  or  not  a 
complete  layout  exists  containing  a given  substring  of  three  vertices. 

(Recall  that  a "complete  layout”  always  has  bandwidth  2 according  to 
the  definition  in  Section  2.) 

Let  us  first  develop  an  algorithm  which  decides  in  0(n)  steps 
whether  or  not  there  is  a complete  layout  for  a given  connected  graph  G , 
containing  a given  substring  abed  of  length  L : 

Step  1.  Stop  with  failure  if  a — d . 

Step  2.  Let  GQ  be  the  graph  obtained  from  G by  deleting  all 

edges  among  [a,b,  c,  d}  . If  there  is  a path  in  Gp  from 
a or  b to  c or  d , stop  with  failure.  (This  path 
cannot  possibly  be  incorporated  into  a complete  layout 
containing  abed  , since  it  cannot  get  to  the  right  of  b . ) 

Step  3.  Let  the  vertices  of  V\{a,b,c,  d]  be  partitioned  into  two 
subsets 

= {v  | a path  exists  in  GQ  from  v to  a or  b]  , 

V2  = {v  | a path  exists  in  GQ  from  v to  c or  d]  . 

(By  step  2,  and  are  disjoint.  Furthermore 

V=  {a,  b,  c,  d}  U V-L  U V0  > since  G was  connected.)  Let 
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be  Gq  restricted  to  V^U^ajb}  , and  let  Gq 
be  Gq  restricted  to  VpU{c,  d]  . Use  the  subalgorithm 
to  find  a layout  cp1  for  Gx  beginning  with  ba  , 
and  also  to  find  a layout  tp  for  Gq  beginning  with  cd 

If  either  attempt  fails,  stop  with  failure;  otherwise 
stop  with  success,  since  is  a complete  layout 

for  G as  required.  □ 

Now  to  solve  the  similar  problem  given  a substring  abc  of  length  5 
we  consider  two  cases: 

(i)  There  is  at  least  one  vertex  d ^ a,  c such  that  b — d . Then 
the  complete  layout  must  contain  either  abed  or  aabc  , and  we 
use  the  previous  algorithm  to  try  both  cases. 

(ii)  There  is  no  vertex  d ^ a, c such  that  b — d . Then  we  can  use 
an  algorithm  analogous  to  the  one  above:  Let  Gq  bo  G minus 
all  edges  among  {a,b,c}  and  stop  if  there  is  a path  from  a 

to  c in  Gq  . Otherwise  partition  V\{a,b,cJ  into  disjoint 
sets  V-j^  and  Vq  , where  contains  the  vertices  reachable 

from  a and  Vq  those  reachable  from  c . Any  complete  layout 
containing  the  substring  abc  must  be  composed  of  a complete 
layout  for  G^  ending  with  ab  and  a complete  layout  for 
beginning  with  be  . 

It  is  also  possible  to  construct  a linear-time  algorithm  that 
decides  whether  or  not  a complete  layout  exists  containing  a given 
substring  ab  of  length  2 ; details  are  left  to  the  reader. 
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8.  Tree  Bandwidth  is  NP-complete. 

In  this  section  we  shall  prove  that  the  general  problem  of  determining 
the  bandwidth  of  a graph  is  NP-complete;  that  is,  any  problem  in  the  large 
class  NP  can  be  transformed  into  the  problem  of  determining  whether  or 
not  the  bandwidth  of  some  graph  is  less  than  some  integer  k , with  at  most 
a polynomial  increase  in  the  size  of  the  problem  specification.  (See  [25] 
and  [2,  Chapter  10]  for  surveys  of  NP-complete  problems.)  This  particular 
result  was  first  obtained  by  C.  H.  Papadimitriou  [28];  we  shall  prove  it 
in  a sharper  form,  by  severely  restricting  the  form  of  G . 

Theorem.  The  following  problem  is  NP-complete:  Given  an  integer  k , and 

given  a graph  G which  is  a free  tree  with  no  vertices  of  degree  > 5 , is 
Bandwidth(G)  < k ? 

Proof.  The  problem  of  determining  whether  or  not  Bandwidth (G ) k , given 
k find  an  arbitrary  graph  G , is  clearly  in  NP  . We  shall  complete  the 
proof  by  showing  that  the  "3-partition  problem,  " which  is  known  to  be 
NP-complete  [16,  p.  120],  can  be  polynomially  transformed  into  the  restricted 
bandwidth  problem  stated  in  the  theorem. 

Given  a sequence  of  3n  integers  (a^,  a^,  . . .,  a^)  , where 

al  + a2  + ' * ' + = nA  3/113  A/**  < a^  < A/2  for  each  i , the  3-partition 

problem  asks  whether  or  not  there  is  a way  to  partition  the  integers 
fl,2,  ...,3n]  into  disjoint  triples  T-^,  ...,Tn  so  that  D [a^  | j e T^}  = A 

for  1 < i < n . In  other  words  it  is  a special  bin-packing  problem, 
where  we  are  to  take  3n  objects  of  integer  sizes  a^,  a^, . . .,  and 

pack  them  into  n boxes  of  size  A whenever  possible.  The  condition 
A/U  < a^  < A/2  means  that  each  box  in  any  such  packing  must  contain 
exactly  three  objects. 

Given  the  specification  of  a 3~Partition  problem,  our  job  is  to  construct 
an  integer  k and  a free  tree  G whose  vertices  all  have  degree  < 3 , 
such  that  there  is  a 3-partition  if  and  only  if  Bandwidth (G)  < k . From 
the  proof  in  [14]  it  suffices  to  do  this  with  a tree  whose  size  is  at  most 
a polynomial  in  n and  A , since  the  3~partition  problem  is  N1 -complete 
even  when  the  magnitudes  of  all  3 n numbers  are  bounded  above  by  a 
(suitably  large)  polynomial  function  of  n . (See  [15]  for  a discussion 

of  this  "strong  NP-completeness"  property.) 
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The  free  trees  we  shall  construct  bear  more  resemblance  to  pelagi c 
hydrozoa  of  the  order  Siphonophora  than  to  actual  trees,  so  we  shall  find 
it  convenient  to  use  terms  from  marine  biology  rather  than  botany.  Our 
construction  involves  parameters  m-^,  ...,m^n  , d , and  k which  we  shall 
specify  later  after  the  properties  we  need  for  the  proof  have  been 
explained. 

The  graphs  of  interest  to  use  all  have  the  general  structure  shown  in 
Figure  3.  There  is  a long  stem,  a path  in  which  every  d-th  vertex  has  a 
special  name;  the  respective  names  of  these  special  stem  vertices  are 

b0  hl  bl  P1  b2  fl  b3  p2  bl*  f2  * " pn  b2n  fn  b2n+l  h2  b2n+2  ^ b2n+3 

from  left  to  right.  It  follows  that  the  stem  contains  4dn+6d+l  vertices 
in  all.  There  are  also  3n  long  tentacles  attached  to  special  vertices 
tp  ...,t^n  ; the  i-th  tentacle  consists  of  a long  fi lament  followed  by  2iru 
nematocysts  as  shown  in  Figure  4.  If  we  break  off  each  tentacle  just 
below  the  node  t^  , and  if  we  remove  the  boundary  nodes  bQ, b^, . . ., b^n+^  , 
the  remaining  graph  consists  of  2n+3  connected  pieces  called  polyps, 
named  respectively 

H1  P1  F1  P2  F2  Pn  Fn  H2  «3 

from  left  to  right.  Note  that  the  vertices  t^,  . ..,t^n  all  belong  to  the 
polyp  called  , the  animal's  "second  head". 


4dn  vertices 


t.  filament 

1 


2m.  vertices 

'2222/2 

nematocysts 


Figure  4.  General  form  of  the  i-th  tentacle. 


25 


r 

I 


f 


t 


[ j 

fV 


We  have  noted  that  the  special  vertices  b ,h, , b, , . . .,b„  ,,  are 

O'  1’  1 2n+3 

separated  by  distance  d ; our  construction  will  also  have  the  property 
that  every  node  of  a polyp  , or  is  distance  < d from  its 

"central"  node  In  , p^  , or  ft  . 

Now  we  shall  impose  further  constraints  on  the  construction,  so  that 
it  will  not  be  easy  to  make  layouts  of  bandwidth  k . In  the  first  place, 
we  will  require  each  of  the  heads  to  contain  exactly  2dk-l  vertices. 

This  means  that  there  are  exactly  2dk  vertices  ^ in  at  distance  < d 
from  h^  (since  each  head  touches  two  boundary  nodes  b,  ),  so  it  is 
necessary  to  lay  these  vertices  out  in  such  a way  that  the  dk  nearest 
locations  on  each  side  of  h.  are  occupied  by  precisely  those  elements 
at  distance  d or  less  in  the  graph.  In  particular,  consider  the  layout 
of  , and  assume  without  loss  of  generality  that  vertex  b^  occurs  to 
the  right  of  h-^  ; then  all  of  the  other  polyps  must  appear  to  the  right 
of  in  the  layout,  since  there  is  no  way  for  any  of  their  vertices  to 

get  to  the  left  of  h^  without  making  the  bandwidth  > k . A similar 
argument  applies  to  the  third  head  , which  therefore  must  appear 
(together  somehow  with  b0n+^  ) at  the  extreme  right  of  the  layout.  All 
of  the  other  polyps,  and  all  of  the  tentacles,  must  appear  between 
and  . 

We  shall  arrange  things  so  that  the  total  number  of  vertices  in  the 

graph  is  exactly  (2n+3 ) (2dk)+l  . This  means  that  the  situation  will  be 

very  "tight":  There  are  (2n+l) (2dk)-l  vertices  which  must  appear  in  the 

layout  between  b^  and  b0n^  > but  vertices  b^  anu  b,,^^  are  at  dist.-ince 

(2n+l)(2d)  from  each  other  in  the  graph,  so  we  must  conclude  that  the  stem 

between  b-^  and  bgn+2  is  stretched  tightly.  In  other  words,  two  adjacent 

nodes  in  this  portion  of  the  stem  must  be  placed  k positions  apart. 

(it  does  not  follow  that  the  stem  from  b_  to  h,  or  from  h_  to  b^ 

' 0 1 3 2n+3 

is  stretched;  bQ  might  even  appear  to  the  right  of  h^  . But  all  we 
are  using  and  for  is  to  confine  the  other  nodes  and  therefore  to 

assign  a rigid  structure  to  the  interior  parts  of  the  layout.) 

Since  the  stem  is  stretched  tightly,  and  since  the  polyps  contain  no 
nodes  at  distance  > d from  their  central  node,  the  layout  must  now  appear 
as  a sequence  of  regions  which  we  may  represent  as  follows: 
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• 0 • 


H'  b,  P'  b„  F'  b,  P’  b 


‘1  ~1  ‘1 


1 u3  r2  “h  2 


PA  b2n  FA  b2n+l  b2n+2 


Here  is  a layout  of  U {bQ}  , is  a layout  of  , Hj  is 

a layout  of  H^U{b2n+_,]  find  (P^  , Fp  are  respectively  layouts  of 

(P^  , F^)  plus  portions  of  the  tentacles  which  just  manage  to  fit.  Each 
of  the  regions  Pj  , F^  includes  exactly  2dk-l  vertices  of  the  layout. 
The  reader  should  stop  at  this  point  to  review  the  construction  before 
going  on. 

If  we  choose  the  sizes  of  p\  , F^  carefully  it  will  be  difficult 


to  place 

the 

tentacles. 

Let  us 

say  that 

F. 

l 

contains 

exactly 

2dk  - 1 - 6di  vertices. 

P. 

i 

contains 

exactly 

2dk  - 1 - c - l8di  + 12d  vertices, 

so  that 

F! 

l 

contains 

exactly 

6di  tentacle  vertices, 

and 

p: 

X 

contains 

exactly 

c + l8di  - 12d  tentacle  vertices. 

where  c is  a constant  to  be  determined  later.  Note  that  the  tentacles 
are  all  connected  to  > so  they  have  to  emanate  from  near  the  right  end 
of  the  layout,  passing  through  Fj  before  coming  to  F^  . If  Pj^, . . . , p^ 
together  contain  portions  of  at  least  r^  different  tentacles  then  F^ 
must  contain  at  least  2dr^  vertices  of  these  tentacles,  since  a path 
cannot  cross  F.^  without  using  up  at  least  2d  positions;  hence  2dr^.  < 6di 
i.e., 

ri  < 3i  . 

Furthermore  if  r = 3i  each  tentacle  must  use  exactly  2d  positions  of 
F.!  , so  there  can  be  no  nematocysts  in  F!^  in  this  case. 

By  choosing  the  values  of  c,m^, ...,m^n  we  will  be  able  to  guarantee 
that  exactly  3i  tentacles  come  through  F|  . Consider  first  P^  , which 
must  contain  c+  6d  tentacle  vertices;  these  must  come  from  at  most  three 
different  tentacles  because  of  the  constraint  on  r^  . If  we  choose  each 
nn  as  a function  of  the  given  numbers  a^  so  that  the  number  of  nodes  in 
two  tentacles  is  always  less  than  c + 6d  , then  P-^  must  contain  vertices 
from  exactly  three  different  tentacles,  and  it  must  include  all  of  their 
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Furthermore  we  will 


nematocysts  too  because  of  the  constraint  on  . 
be  able  to  argue  in  the  same  way  that  p^  must  now  include  all  the 
nematocysts  of  three  other  tentacles  because  of  the  constraint  on  Fi,  , 
and  so  on. 

In  order  to  make  this  argument  go  through  properly  we  will  want  to 

define  things  so  that  the  three  tentacles  whose  nematocysts  appear  in  P| 

have  their  filaments  "pulled  completely  through"  the  succeeding  regions, 

with  exactly  2d  vertices  of  their  filaments  appearing  in  each  of 

F!  , P!  P'  , F'  . It  turns  out  that  we  can  do  this  by  making  each 

1 l+l  n n 

irn  a multiple  of  6dn  , and  requiring  that  a^  + a^.  + a^  = A if  and  only 

if  2(m.  + m.  + m„ ) = c . Let  us  set 
v i J l 

m.  = 6dna.  , c = 12dnA  : 
ii  ’ 

we  shall  prove  that  a layout  of  bandwidth  k implies  the  existence  of  a 
3 -partition: 


Lemma.  For  1 < i < n , region  P|  contains  all  of  the  nematocysts  from 
exactly  three  tentacles,  namely  the  tentacles  connected  to  t . where  j 
is  in  some  triple  T.  , and  Ti  (a.  \ j e T. } = A . Furthermore  P!  also 
contains  as  much  as  possible  of  the  filaments  from  these  tentacles,  i.e., 
each  tentacle  in  1L  has  only  2d  vertices  in  each  of 


F'. 

l 


Pi+1 ' 


FA 


Proof.  By  induction  on  i , we  know  that  Fj  and  Pj  each  contain 
3(i-l)(2d)  filament  nodes  from  tentacles  whose  nematocysts  appear  in 
P|  ...  Pi  ^ . That  leaves  6d  empty  positions  in  F|  and  12dnA + 12di  - 6d 
in  p!  . Now  P^  must  contain  vertices  from  at  least  three  tentacles, 
since  two  tentacles  have  at  most  8dn+2(nm  + mp  *=  8dn  + 12dn(a^.  + a^) 

< 8dn + 12dn(A-l)  = 12dnA  - Udn  vertices  altogether. 

Hence  PI  has  vertices  from  exactly  three  tentacles,  defined  by  some 
triple  c [1,2, ...,3n}  , and  it  includes  all  of  their  nematocysts 
because  F!  has  room  for  only  6d  more  vertices  from  all  three  tentacles. 
Let  2 {a.  | j e T. } = a ; then  the  12dnA+  12di  - 6d  available  positions 
in  Pj  are  taken  up  by  12dna  nematocysts  and  somewhere  between  0 and 
3(Udn-  (2n  - 2i+l) (2d) ) = 12di  - 6d  filament  nodes.  It  follows  that  a = A 


# 


V 
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and.  exactly  12di  6d  filament  nodes  are  present.  □ 


The  lemma  proves  that  a bandwidth-k  layout  for  a graph  of  this  kind 
necessarily  leads  to  a valid  3~pa-rtition.  To  complete  the  proof 
of  the  theorem,  we  must  define  the  graphs  so  that  existence  of  a 
3-partition  is  sufficient  to  imply  the  existence  of  a layout  with  bandwidth 
k . This  means  in  particular  that  we  will  have  to  choose  d and  k 
appropriately.  Furthermore  the  graphs  must  be  constructible  by  an  algorithm 
whose  running  time  is  bounded  by  a polynomial  in  n and  A . 

In  the  first  place  we  want  to  choose  k large  enough  that 
contains  at  least  2d-l  vertices,  hence  we  require 

k > 6nA  + 9n  - 5 

For  convenience  we  let  k be  the  smallest  power  of  2 satisfying  this 
condition,  and  we  write 


Finally  we  choose 

d = fk  . 

From  these  parameters  k and  d we  can  construct  G by  explaining 
how  to  construct  each  polyp.  The  head  polyps  are  formed  by  the 

bandwidth-2*  layout  indicated  in  Figure  5 for  l = 3 (although  i will 
never  be  this  small).  A periodic  pattern  begins  to  repeat  after  the 
f-th  stem  node  to  the  right  of  in  : the  j-th  node  preceding  a stem  node 

branches  to  the  (2j)  -th  and  (2j+l)  -st  nodes  preceding  the  next  stem 

i- 1 

node,  for  0 < j < 2 . Before  this  pattern  is  established,  we  have 

(1,2,  4,. ..,2*'^)  the  respective  limits  on  j . An  additional  "thread 
branch"  goes  ou+  of  in  to  fill  up  the  remaining 

(2*-l)  + (2*-2)  + ...  + (2*  - 2*'1)  = fk-2J+l  = d - k+1  holes  near  the 
center.  To  the  left  of  in  we  use  essentially  the  same  idea  in  mirror 
image;  thus  it  is  clear  that  no  vertex  is  at  distance  greater  than  d from 
the  center  node.  The  special  nodes  t^, . . .,  t^n  in  H2  are  taken  to 
the  leftmost  Jn  nodes  in  its  layout. 


i 

i 


■ 
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thread  branch 


first  stem  node 


Figure  5 . Layout  of  a head  polyp  iL  in  the  immediate  vicinity 
of  its  center  node  h.  . 


A similar  procedure  is  used  to  construct  the  other  polyps  and 

. In  each  case  we  wish  to  remove  2dx  nodes  from  a full  head  polyp, 
for  some  integer  x , and  we  do  this  by  removing  x nodes  between  each 
pair  of  adjacent  stem  nodes.  The  x nodes  immediately  to  the  right  of 
each  stem  node  in  Figure  5 are  simply  deleted  from  the  graph,  together  with 
all  edges  touching  them,  and  the  "thread  branch"  is  reconnected  for  the 
remaining  nodes;  again  the  mirror  image  of  this  pattern  is  used  to  the  left  of 
the  center  vertex,  and  we  clearly  have  a tree.  It  is  easy  to  see  that  the 
resulting  polyp  has  a layout  of  length  2dk-l  in  which  the  x positions 
just  to  the  left  of  each  stem  node  are  empty.  (Simply  shift  all  non-stem 
vertices  which  lie  to  the  right  of  the  center  vertex  exactly  x places  to 
the  left.)  These  x slots  form  x parallel  "channels"  through  which 
filaments  can  pass. 

Now  it  is  not  difficult  to  see  how  to  embed  the  tentacles  into  these 
polypi  layouts  whenever  a 3-partition  is  given.  For  example,  we  can  place 
filaments  for  the  three  tentacles  specified  by  T^  into  the  rightmost 
three  channels  of  Fp P?,F2, . . ., P ,F  . Now  it  is  easy  to  make  the  remaining 
nematocyst  and  filament  nodes  fit  into  the  remaining  spaces  in  P,  without 


y.  Directed  Bandwidth. 


Analogous  problems  can  be  studied  when  G is  an  acyclic  directed 
graph,  where  we  require  its  layout  to  be  a topological  sorting  of  the 
vertices;  in  other  words,  we  stipulate  that  f(u)  < f(v)  whenever 
u - v in  the  graph,  and  we  ask  for  the  minimum  bandwidth  subject  to  this 
constraint. 

The  algorithm  in  Sections  2 through  7 above  can  readily  be  modified 
to  test  for  "directed  bandwidth  2 In  fact,  the  situation  becomes  so 
much  simpler  that  it  is  tempting  to  try  for  directed  bandwidth  3 in 
polynomial  time. 

The  NP-completeness  construction  in  Section  8 can  be  modified  in  a 
straightforward  way  to  obtain  an  analogous  result. 

Theorem.  The  following  problem  is  NP-complete:  Given  an  integer  k , 

and  given  a directed  graph  which  is  an  oriented  tree  having  no  vertices 
of  in-degree  > 2 , is  its  directed  bandwidth  < k ? 

(Each  vertex  of  an  oriented  tree  has  out-degree  < 1 , and  there  are  no 
cycles. ) 

The  analogous  problem  of  minimizing  £ (f(v)  - f(u))  over  all  topological 
sortings  of  a general  acyclic  directed  graph  has  recently  been  proved 
NP-complete  by  E.  L.  Lawler  [26];  on  the  other  hand  Adolphscn  and  Hu  [1] 
have  resolved  this  problem  in  polynomial  time  when  the  directed  graph  is  an 
oriented  tree,  even  when  the  arcs  have  been  assigned  arbitrary  weights. 

The  above  theorem  indicates  that  the  bandwidth  problem  is  somewhat  harder 
than  till s optimal  ordering  problem,  in  the  directed  us  well  as  the 
undirected  case. 


10.  Some  Open  Problems. 

The  following  related  questions  are  still  waiting  for  an  answer: 


(a)  Is  the  problem  "Bandwidth (G)  < 3 " NP-complete,  given  an  arbitrary 
graph  (or  perhaps  a tree)  G ? 

(b)  Is  there  a polynomial  time  algorithm  to  enumerate  the  number  of 
distinct  bandwidth-2  layouts  of  a given  graph  G ? 

(c)  For  which  exponents  m is  the  problem  "Some  layout  of  G satisfies 
2D  (j"(u)  - f(v)|n  : u — v in  G]  < k " NP-complete,  when  G is  a free 
tree? 

(d)  What  is  the  expected  minimum  bandwidth,  for  random  graphs  on  n 
vertices  and  m edges,  as  n and  m — « ? 

Question  (b)  is  of  potential  interest  because  there  seems  to  be  a 
vague  connection  between  efficient  algorithms  for  enumeration  and  efficient 
algorithms  for  testing  existence.  For  example,  there  is  a determinant 
formula  for  evaluating  the  number  of  spanning  trees  of  a graph,  and  there 


are  efficient  algorithms  for  testing  connectedness.  The  problems  of 
enumerating  the  number  of  hamiltonian  paths  of  a graph,  or  the  number  of 
ways  to  satisfy  a given  set  of  clauses,  etc.,  do  not  seem  to  be  in  NP; 
there  most  likely  are  polynomial- time  reducibilities  between  such  problems, 
but  such  transformations  remain  to  be  investigated.  In  the  case  of 
bandwidth-2  layouts  for  a graph,  there  is  a linear  time  algorithm  for 
existence,  yet  no  apparently  "nice"  characterization.  So  this  is  a 
candidate  problem  in  which  enumeration  might  be  definitely  more  difficult 
than  existence. 

i 

Question  (c)  is  suggested  by  the  observation  that  the  stated  problem  is 
solvable  in  polynomial  time  for  m = 1 [31],  but  as  m increases  the 

best  layouts  are  eventually  those  with  minimum  bandwidth. 

All  four  problems  can  be  considered  also  for  the  case  of  directed 
bandwidth. 

Another  interesting  question  is  to  discover  how  far  from  optimum  the 
various  heuristic  methods  for  bandwidth  reduction  can  be;  see  the  references 
below  for  several  approaches  that  have  been  proposed. 
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